Validation of phonetic transcriptions in the context of automatic speech recognition

نویسندگان

  • Christophe Van Bael
  • Henk van den Heuvel
  • Helmer Strik
چکیده

Some of the speech databases and large spoken language corpora that have been collected during the last fifteen years have been (at least partly) annotated with a broad phonetic transcription. Such phonetic transcriptions are often validated in terms of their resemblance to a handcrafted reference transcription. However, there are at least two methodological issues questioning this validation method. Firstly, no reference transcription can fully represent the phonetic truth. This calls into question the status of such a transcription as a single reference for the quality of other phonetic transcriptions. Secondly, phonetic transcriptions are often generated to serve various purposes, none of which are considered when the transcriptions are compared to a reference transcription that was not made with the same purpose in mind. Since phonetic transcriptions are often used for the development of automatic speech recognition (ASR) systems, and since the relationship between ASR performance and a transcription's resemblance to a reference transcription does not seem to be straightforward, we verified whether phonetic transcriptions that are to be used for ASR development can be justifiably validated in terms of their similarity to a purpose-independent reference transcription. To this end, we validated canonical representations and manually verified broad phonetic transcriptions of read speech and spontaneous telephone dialogues in terms of their resemblance to a handcrafted reference transcription on the one hand, and in terms of their suitability for ASR development on the other hand. Whereas the manually verified phonetic transcriptions resembled the reference transcription much closer than the canonical representations, the use of both transcription types yielded similar recognition results. The difference between the outcomes of the two validation methods has two implications. First, ASR developers can save themselves the effort of collecting expensive reference transcriptions in order to validate phonetic transcriptions of speech databases or spoken language corpora. Second, phonetic transcriptions should preferably be validated in terms of the application they will serve because a higher resemblance to a purpose-independent reference transcription is no guarantee for a transcription to be better suited for ASR development.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Application-oriented validation o preliminary r

There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recognition technology. Such automatic phonetic transcriptions are usually validat...

متن کامل

A pplication-orien ted validation o f phonetic transcriptions: prelim inary results

There is an increasing need for automatic procedures to generate and validate phonetic transcriptions. As the production of manual phonetic transcriptions tends to be time-consuming, error-prone and costly, procedures have been developed to derive phonetic transcriptions automatically by means of automatic speech recogni­ tion technology. Such automatic phonetic transcrip­ tions are usually val...

متن کامل

Validation of phonetic transcriptions based on recognition performance

In fundamental linguistic as well as in speech technology re­ search there is an increasing need for procedures to automat­ ically generate and validate phonetic transcriptions. Whereas much research has already focussed on the automatic genera­ tion o f phonetic transcriptions, far less attention has been paid to the validation of such transcriptions. In the little research performed in this a...

متن کامل

Automatic derivation of multiple variants of phonetic transcriptions from acoustic signals

This paper deals with two methods for automatically finding multiple phonetic transcriptions of words, given sample utterances of the words and an inventory of context-dependent subword units. The two approaches investigated are based on an analysis of theN -best phonetic decoding of the available utterances. In the set of transcriptions resulting from theN -best decoding of all the utterances,...

متن کامل

Validation and improvement of automatic phonetic transcriptions

The ultimate aim of our research is to show that good-quality phonetic transcriptions of large speech corpora can be obtained by employing automatic techniques initially developed for ASR. The experiment presented in this paper has two aims. The first is to show how the quality of an automatic transcription that is easily obtained through lexicon lookup can be measured in a way that is methodol...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Language Resources and Evaluation

دوره 41  شماره 

صفحات  -

تاریخ انتشار 2007